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Abstract 

One of the most intriguing facts about communication using quantum states is that these states cannot 
be used to transmit more classical bits than the number of qubits used, yet there are ways of conveying 
information with exponentially fewer qubits than possible classically [^/ . Moreover, these methods have 
a very simple structure — they involve little interaction between the communicating parties. We look more 
closely at the ways in which information encoded in quantum states may be manipulated, and consider 
the question as to whether every classical protocol may be transformed to a "simpler" quantum protocol of 
similar efficiency. By a simpler protocol, we mean a protocol that uses fewer message exchanges. We show 
that for any constant k, there is a problem such that its k + 1 message classical communication complexity 
is exponentially smaller than its k message quantum communication complexity, thus answering the above 
question in the negative. Our result builds on two primitives, local transitions in bi-partite states 
on previous work) and average encoding which may be of significance in other applications as well. 



1 Introduction 



A recurring theme in quantum information processing has been the idea of exploiting the exponential 
resources afforded by quantum states to encode information in very non-obvious ways. Perhaps the most 
representative result of this kind is due to Ambainis, Schulman, Ta-Shma, Vazirani and Wigderson [|[, 
which shows that it is possible to deal a random set of ^/N cards each from a set of A^ by the ex- 



change of 0(log A^) quantum bits between two players. Raz |21| gives a communication problem where 
the information storage capacity of quantum states is exploited more explicitly. Both are examples of 
problems for which exponentially fewer quantum bits are required to accomplish a communication task, 
as compared to classical bits. 

The protocols presented by also share the feature that they require minimal interaction between 

the communicating players. For example, in the first protocol, one player prepares a set of qubits in a 
certain state and sends half of them across as the message, after which both players measure their qubits 
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to obtain the result. On the other hand, efficient quantum protocols for problems such as checking 
set disjointness (DISJ) seem to require much more interaction: Buhrman, Cleve and Wigderson |Q] give 
an 0{^/N log N) qubit protocol for DISJ that takes 0(\/iV) message exchanges. This represents quadratic 
savings in communication cost, but also an unbounded increase in the number of messages exchanged 
(from one message to \/iV), as compared to classical protocols. Can we exploit the features of quantum 
communication and always reduce interaction while maintaining the same communication cost? In other 
words, do all efficient quantum protocols have the simple structure shared by those of ^ ? 

In this paper, we study the effect of interaction on the quantum communication complexity of problems. 
We show that for any constant k, allowing even one more message may lead to an exponential decrease 
in the communication complexity of a problem, thus answering the above question in the negative. More 
formally. 

Theorem 1.1 For any constant k, there is a problem such that any quantum protocol with only k mes- 
sages and constant probability of error requires n{N^^^''~^^^) communication qubits, whereas it can be 
solved with k + 1 messages by a deterministic protocol with 0(log N) bits. 



Klauck |11] states a relationship between the bounded message complexity of Pointer Jumping and DISJ. 
Together with our result, this implies an r2(A^^/'^('^'+^)) lower bound for k message protocols for DISJ, for 
any constant k. 

The role of interaction in classical communication is well-studied, especially in the context of the 
pointer jumping function ||l8|, 0, 19|. In fact, the problem we study in this paper is the subproblem 



of Pointer Jumping singled out in |15[| . Our analysis has the same gross structure as that in |T^] (also 
explained in [|l^]), but relies on entirely new ideas from quantum information theory. 

In the context of quantum communication, it was observed by Buhrman and de Wolf |5| (based on a 
lower bound of Nayak p^ ) that any one message quantum protocol for DISJ has linear communication 
complexity. Thus, allowing more interaction leads to a quadratic improvement in communication cost. 
The lower bound of ||l^ immediately implies a much stronger separation: it shows that the two message 
complexity of a problem may be exponentially smaller than its one message complexity (see also |11|). 
Our result subsumes all these. 

Our interest in the role of interaction in quantum communication also springs from the need to better 
understand the ways in which we can access and manipulate information encoded in quantum states. We 
develop information-theoretic techniques that expose some of the limitations of quantum communication. 
More specifically, we present a new primitive in quantum encoding, as suggested by the following theorem. 



Theorem 1.2 (Average encoding theorem) Let x ^ be a quantum encoding mapping m bit 
strings x € {0,1}"^ into mixed states ax- Let X be distributed uniformly over {0,1}'", let Q be the 
encoding of X according to this map, and let a = J2x ■ Then, 

X 

In other words, if an encoding Q is only weakly correlated to a random variable X, then the "average 
encoding" a (corresponding to a random string) is on average a good approximation of any encoded 
state. Thus, in certain situations, we may dispense with the encoding altogether, and use the single 
state a instead. 

We also use another primitive derived from the work of Lo and Chau [^] and Mayers [14| which 
combines results of Jozsa [^], and Fuchs and van de Graaf [^]. Consider two bi-partite pure states 
such that one party sharing the states cannot locally distinguish well between the two states with good 
probability. Then the other party can locally transform any of the states close to the other. 
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Theorem 1.3 (Local transition theorem) (based on ^1^ , I4, IC, ^) Let pi,p2 two mixed states 
with support in a Hilhert space 7i, IC any Hilbert space of dimension at least dim('H), and {(pi) any 
purifications of pi in 7i ® IC. Then, there is a local unitary transformation U on IC that maps \(j)2) 
to \(t)'2) = I ®U \(j)2) such that 

||l'^l)('^l|-|</'2)(</'2||lt < 2IIP1-P2IIJ. 

This may be of significance in cryptographic applications as well. 
2 Preliminaries 



2.1 The communication complexity model 

In the quantum communication complexity model Alice and Bob hold qubits. When the game 
starts Alice holds a superposition |x) and Bob holds \y) (representing the input to the two players), and 
so the initial joint state is simply \x) ®\y). The two parties then play in turns. Suppose it is Alice's turn 
to play. Alice can do an arbitrary unitary transformation on her qubits and then send one or more qubits 
to Bob. Sending qubits does not change the overall superposition, but rather changes the ownership of 
the qubits, allowing Bob to apply his next unitary transformation on the newly received qubits. At the 
end of the protocol, one player makes a measurement and declares that as the result of the protocol. 

In general, each player may also (partially) measure her qubits during her turn. However, we assume 
(by invoking the principle of safe storage [^) that all such measurements are postponed to the end. We 
also assume that the two players do not modify the qubits holding the input superposition during the 
protocol. Neither of these affects the aspect of communication we focus on in this paper. 

The complexity of a quantum (or classical) protocol is the number of qubits (respectively, bits) ex- 
changed between the two players. We say a protocol computes a function f : X xy ^ {0, 1} with e > 
error, if for any input x ^ X ,y ^ y the probability that the two players compute /(x, y) is at least 1 — e. 
Qeif) denotes the complexity of the best quantum protocol that computes / with at most e error. 

For a player P e {Alice, Bob}, Q'^'^if) denotes the complexity of the best quantum protocol that 
computes / with at most e error with only c messages, where the first message is sent by P. If the name 
of the player is omitted from the superscript, either player is allowed to start the protocol. 

We say a protocol V computes f with e error with respect to a distribution n on X x y, 

Prob(^^j,)g^^7,(P(x, y) = f{x, y)) > 1 - e. 

Qli^if) is complexity of computing / with at most e error with respect to /i, with only c messages 
where the first message is sent by player P. The following is immediate. 

Fact 2.1 For any distribution ji, number of messages c and player P , Q'jj^if) < Q^'^if)- 

2.2 Classical entropy and mutual information 

The Shannon entropy S{X) of a classical random variable X quantifies the amount of randomness in 
it. If X takes values x in some finite set with probability px, its Shannon entropy is defined as S{X) = 
— J2x ^ogpx- The mutual information I(X : y) of a pair of random variables X, Y is defined by I{X : 
Y) = S{X) + S{Y) — S{XY). It is a measure of how correlated the two random variables are. 
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The following are some basic facts about the mutual information function that we use in the paper. 
For any random variables X, Y, Z, 

I{X : YZ) = I{X : Y) + I{XY : Z) - I{Y : Z) (1) 
I{X : YZ) > I{X : Y). (2) 

Fano's inequality states that if Y can predict another random variable X with an advantage, then X 
and Y have large mutual information. We use it only in the following simple form. 

Fact 2.2 (Fano's inequality) Let X be a uniformly distributed boolean random variable, and let Y be 
a boolean random variable such that Prob(X = Y) > ^ + 5, where 6 > 0. Then I{X :Y) >! — + 6) . 

For other equivalent definitions and properties of these concepts, we refer the reader to a standard 
text (such as |^) on information theory. Finally, we give a simple bound on the deviation of the binary 
entropy function H{p) from 1 as p deviates from 1/2. 

Fact2.3 For 6 we have H{1 + 6) < 1-6'^. 

Proof: Prom the definition of the binary entropy function, we have 

H{^ + 5) = l-l[(l + 25)log(l + 25) + (l-2,5)log(l-25)]. 

2 3 4 

Using the expansion ln(l + x) = x — %- + ^ — ^ + ' ' ' for < 1, and simplifying, we get 



+ = l-(loge) 
which is the claimed bound. ■ 



2-2 \ 3 2-4/ \ 5 2-6 



2.3 The density matrix and the trace norm 



The quantum mechanical analogue of a random variable is a probability distribution over superposi- 
tions, also called a mixed state. Consider the mixed state X = {pi, \4>i)}, where the superposition \(f)i) is 
drawn with probability Pi . The density matrix of the mixed state X is px = ^iPi\<Pi){4'i\- The following 
properties of density matrices are immediate from the definition: every density matrix p is Hermitian, 
i.e., p = p\ has unit trace, i.e., Tt{p) = J2iP{hi) = !> and is positive semi-definite, i.e., (V'l p\il^) >0 for 
all {i/j). Thus, every density matrix is unitarily diagonalizable and has non-negative real eigenvalues that 
sum up to 1. 

Given a quantum system in a mixed state with density matrix p and a (general) measurement O on 
it, let p'^ denote the classical distribution on the possible results that we get by measuring p according 
to O. Suppose that it is some classical distribution pi, . . . where we get result i with probability pi. 
Given two different mixed states, we can ask how well one can distinguish between the two mixtures, 
or equivalently, how different the distributions resulting from a measurement may be. To quantify this, 
we consider the £i metric: if p = (pi,...,^^) and q = {qi, ■ ■ ■ ,qk) are two probability distributions 
over {1, . . . , k}, then the ii distance between them is || p — g = Z^i \Pi ~ Qi\- A fundamental theorem 
about distinguishing density matrices (see ||T[) tells us: 
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Theorem 2.4 Let pi,p2 be two density matrices on the same space 7i. Then for any (general) measure- 
ment O 

|pf-p?||^ < Tr^/AU, 

where A = pi — p2- Furthermore, the hound is tight, and the orthogonal measurement O that projects a 
state on the eigenvectors of pi — p2 achieves this hound. 



Theorem |2.4| shows that the density matrix captures all the accessible information that a quantum 
state contains. If two different mixtures have the same density matrix (which is indeed possible) then 
even though they are two distinct distributions, they are physically, and thus from a computational point 
of view, indistinguishable. As the behavior of a mixed state is completely characterized by its density 
matrix we often identify a mixed state with its density matrix. 

The quantity Tr \/ A is of independent interest. (Note that this is compact notation for the sum of 
the (magnitudes of the) singular values of ^4.) If we define || A \\^ = Tr V A'^ A then || • ||^ defines a norm 
(the trace norm), and has some additional properties such as ||^C?ii?||^. = - \\ A\\^^ = 1 for 

any density matrix A and || AB \\^ , || BA \\^ ^ \\ A \\^ • || B ||^. (See Q for more details.) We single out the 
following fact for later use. 

Fact 2.5 If , \ (j)2) are two pure states, and pi is the density matrix of {(pi), then 

\\P1-P2\\t = 2^1-\{<f>i\<t>2)\^. 

2.4 The fidelity measure 

A useful alternative to the trace metric as a measure of closeness of density matrices is fidelity, which 
is defined in terms of the pure states that can give rise to those density matrices. A purification of a 
mixed state p with support in a Hilbert space 7i is any pure state |(/>) in an extended Hilbert space TL^K. 
such that Tr/c \(t>){4'\ = P- Given two density matrices pi,p2 on the same Hilbert space Ti., their fidelity 
is defined as 

F{pi,P2) = sup \{(l)l\(t)2)\^ , 

where the supremum is taken over all purifications of pi in the same Hilbert space |jlO|. We state 
a few properties of this measure: < F{pi,p2) < 1, F{pi,p2) = 1 <^=^ pi = P2 and if pi = 
then we have F{pi,p2) = (</>i| P2 l^i)- Jozsa proved that the optimum is always achieved when finite 
dimensional density matrices are considered. 

Theorem 2.6 (Jozsa) Let pi,P2 he any two mixed states with support in a finite dimensional Hilbert 
space 7i, IC a Hilhert space of dimension at least dim('H), and any purification of p\ inH^K,. Then 
there exists a purification \(j)2) € W /C of p2 such that \ {(pi \<j)2)\'^ = F{pi, p2). 



Jozsa flOll also gave a simple proof (again for the finite dimensional case) of the following remarkable 



equivalence first established by Uhlmann [22 



Fipi,p2) = Tr(^/32^/pr) 



Using this equivalence, Fuchs and van de Graaf show that the fidelity and the trace measures of 
distance between density matrices are closely related. They prove: 

Theorem 2.7 (Fuchs, van de Graaf) For any two mixed states pi,p2, 



'^-yF{pi,P2) < ^\\pi-p2\\t < F{Pl,P2)- 
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2.5 Von Neumann entropy and quantum mutual information 

As mentioned earlier, the eigenvalues of a density matrix are all real, non-negative and sum up to one. 
Thus, they induce a probability distribution on the corresponding eigenvectors. Since the eigenvectors 
are all orthogonal, this is essentially a classical distribution. Every mixed state with the same density 
matrix is physically equivalent to such a canonical classical distribution. It is thus natural to define 
the entropy of a mixed state as the Shannon entropy of this distribution. Formally, the von Neumann 
entropy S{p) of a density matrix p is defined as S{p) = —J2i K^ogXi, where {Aj} is the multi-set of all 
the eigenvalues of p. More compactly, S{p) = —Trplogp. 

Not all properties of classical Shannon entropy carry over to the quantum case. For example it is quite 
possible that S{XY) < S{X) as can be seen by considering the pure state -^{\^)x \^)y + \'^)x |l)y)- 
Nonetheless, some of the classical properties do carry over, e.g., S{X) > 0, S{X) is concave and S{XY) < 
S{X) + S{Y). A property of interest to us is the following, which also generalizes a classical assertion. 

Fact 2.8 Suppose a quantum system A is in mixed state {pi, \i)}, where {\i)} are orthogonal, and ai are 
density matrices for another system B, then S(J2i Pi 10(^1 ® ^i) — ^i^) + J2i Pi S{cri). 

The density matrix corresponding to a mixed state with superpositions drawn from a Hilbert space 7i 
is said to have support in TC. A density matrix with support in a Hilbert space of dimension d, has d 
eigenvalues, hence the entropy of any such distribution is at most logd. A pure-state has zero entropy. 
Measuring a pure-state may result in a non-trivial mixture and positive entropy. In general, orthogonal 
measurements increase the entropy. For a comprehensive introduction to this concept and its properties 
see, for instance, [pO| . 

We define the "mutual information" I(X : y) of two disjoint systems X, Y in analogy with classical 
mutual information: I{X : Y) = S{X) + S{Y) — S{XY), where XY is density matrix of the system 
that includes the qubits of both systems. Again, not all properties of classical mutual information carry 
over to the quantum case. For example, it is not true in general that I{X : Y) < S{X). Nonetheless, 
some of the intuition we have about mutual information still applies. Equation (||) still holds, as follows 
immediately from the definition. Equation (|2|) also continues to be true, but its proof is much more 
involved. It is in fact equivalent to the strong sub-additivity property of von Neumann entropy. An 
important consequence of this property is that local measurements can only decrease the amount of 
mutual information. A special case of this is the classic Holevo theorem from quantum information 
theory, which bounds the amount of information we can extract from a quantum encoding of classical 
bits. 

Theorem 2.9 (Holevo) Let x ^ ax be any quantum encoding of bit strings into density matrices, let X 
be a random variable with a distribution given by Pioh{X = x) = Px, let Q be the quantum encoding 
of X according to this map, and let a = ^xPx^x- IfY is any random variable obtained by performing a 
measurement on the encoding, then 

I{X:Y) < I{X:Q) = S{a) -J^PxSiax). 

X 

In analogy with classical conditional entropy, we define S(Y\X) = J2xPxS{o'x), where X is a classical 
random variable and y is a quantum encoding of it given hy x ^ ax- We similarly define conditional von 
Neumann entropy and mutual information with respect to a classical event. Thus, for example, I{X : 
Y) = S{Y) - S{Y\X). 
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3 The average encoding theorem 



The average encoding theorem asserts that if a quantum encoding has httle correlation with the 
encoded classical information then the encoded states are essentially indistinguishable. In particular, 
they are all "close" to the average encoding. This theorem formalizes a very intuitive idea and might 
seem to be immediate from Holevo's theorem. However, there is a subtle difference: in Holevo's theorem 
one is interested in a single measurement that simultaneously distinguishes all the states, whereas in our 
case we are interested in the pairwise distinguishability of the encoded states. We first prove: 

Theorem 3.1 Let x ax be a quantum encoding mapping m bit strings x S {0, 1}™" into mixed 
states Gx- Let X be distributed uniformly over {0, 1}*" and let Q be the encoding of X according to 
this map. Denote A = 23sr Ea;i,x26{o,i}'" II ~ ^2:2 lit- ^^e"' : Q) > 1 - H{^^). 



Proof: We start with the special case of m = 1. By Theorem p.4| , there is a measurement O on Q 
that realizes the trace norm distance t = \\ cfq — cti \\^ between uo and cti. Using Bayes' strategy (see, for 
example, j^]), the resulting distributions can be distinguished with probability | + |- Let Y denote the 
classical random variable holding the result of this entire procedure. We have Prob(y = X) = ^ + |- 
Thus, by Fano's Inequality, 

L{X:Y) > l-Hi^+^-) 

We complete the proof for m = 1 by noticing that measurements can only reduce the entropy, hence I{X : 
Q) > I{X : Y), and that A = f . 

To prove the theorem for general m we reduce it to the m = 1 case. We do this by partitioning the 
set of strings into pairs with "easily" distinguishable encoding. 

Lemma 3.2 There is a set o/2'"/2 disjoint pairs {x2i~.i,X2i) which together cover {0,1}"* such that 

2 



2r. 



Proof: The expectation of the LHS over a random pairing is ^„,__j^ A; hence there is a pairing that 
achieves this A. ■ 

We now fix this pairing. Let Zi denote the set of elements in the i'th pair, i.e., Zi = {a;2i-i, a^2j} 
and Aj = || 0x21-1 ~ ^x^i ||t- know that ^ ^ Let us also denote fiS) = 1 — LLi}-^-)- From 

the base case m = 1, we know that for any i = 1, . . . , 2^/2, 1{X : Q\X ^ Z^) > /(Aj). Thus we get: 

S{Q\XeZ,)-^[S{ax2j-S{ax2,^,)] > f{A,). 
Averaging all the 2*^/2 equations yields: 

i X i 

By the concavity of the entropy function, S{Q) > ^ J2i S{Q \ X € Zi), and by definition J2x S{o'x) = 
S{Q\X). Therefore, 

I{X:Q) = S{Q)-S{Q\X) > ^Y.fi^i)- 
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Since / is convex, ^Y^if{\) > /(^Z^j^*)- Also, f{6) is monotone increasing for < 5 < ^ 
so Ei ^i) > /(^)- Together this yields I{X : Q) > /(A), as required. ■ 

Now, we can easily deduce Theorem 
Proof of Theorem 1.2: Let A' = J2xi II — c lit- We have: 



2' 



A' 



om ^ ^ II '''^1 



(7 



a;2 



< — y 



a. 



X2 lit 



< A 



X1,X2 



By Theorem O, /(X : Q) > 1-H{^), and by Fact O we have 1 - > l-(l-(4)2) = ^. 



Thus, A' < A < 2^I{X : Q). ■ 

4 Local transition between bipartite states 



Lo and Chau |13] and Mayers [14| proved: 



Theorem 4.1 (Lo and Chau; Mayers) Suppose and |02) fl'^e two pure states in the Hilbert space 
TL^K,, such that Tr^: |02)(02| = Tr/c |</>i)((/>i|, i.e., the reduced density matrix of \(j)2) to TL is the same 
as the reduced density matrix of to 7i. Then, there is a local unitary transformation U on fC such 
that Kg) U \(l)2) = \ct>i). 



The theorem follows by examining the Schmidt decomposition [20| of the two states. 

A natural generalization of this is to the case where the reduced density matrices are close to each 



other but not quite the same, which is what appears in Theorem |L^. Lo and Chau |13] and Mayers |14] 



considered this case as well. Theorem L3 follows from their work by using the newer results of M stated 



in Theorem 2.7 



Proof of Theorem |1.3| : By Theorem 2^, there exists a purification |02) G TL ® K, of p2 such 



that |(<Ai 1^^2)1 = F{pi:P2)- Since \(f)2) and l^g) have the same reduced density matrix in by The- 
orem LI, there is a (local) unitary transformation [/ on /C such that / ® U\(j)2) = W2)- Moreover, by 
Fact we have 



||l'Al)(</'l|-k2)('/'2|||t = 2V'1-|(</>1|0'2>|' 

By Theorem |2]^, y/F{pi,p2) > 1 - ^ || pi - /02 \\t, so 



2Jl-F(pi,p2) 



1-Fipi,p2) < 1 



1-^11.1 



P2 



< 



\Pl- P2 



This, when combined with the earlier bound on the trace distance between \(j)i) , \(f)'2) gives us the required 
result. ■ 

5 The role of interaction in quantum communication 

In this section, we prove that allowing more interaction between two players in a quantum com- 
munication game can substantially reduce the amount of communication required. We first define a 
communication problem and state our results formally (giving an overview of the proof), and then give 
the details of the proofs. 
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5.1 The communication problem and its complexity 



In this section, we give the main components of the proof of Theorem 1.1. We define a sequence of 
problems 5o, Si, ... , Sk, • • • by induction. The problem Si is the index function, i.e., Alice has a n-bit 
string X £ Xi = {0, 1}", Bob has an index i G yi = [n] and the desired output is Si{x,i) = Xi. Suppose 
we have already defined the function S/^-i '■ ^^fc-i x y^^i — > {0, 1}. In the problem S/., Alice has as input 
her part of n independent instances of Sk-i, i.e., x G ^k-i^ Bob has his share of n independent instances 
of Sk-i, i.e., y G yj^-i, and in addition, there is an extra input a S [n] which is given to Alice if k is even 
and to Bob if k is odd. The output we seek is the solution to the ath instance of S/^-i- In other words, 

Sk{xi,. . . ,Xn,a,yi,. . . ,yn) = Sk-l{Xa,ya)- 

Note that the input size to the problem Sk is N = 0(n^). If we allow k message exchanges for solving 
the problem, it can be solved by exchanging 0(logA^) = ©(/clogn) bits: for A; = 1, Bob sends Alice 
the index i and Alice then knows the answer; for k > 1, the player with the index a sends it to the 
other player and then they recursively solve for Sk-i{xa,ya)- However, we show that if we allow one 
less message, then no quantum protocol can compute Sk as efficiently. In fact, no quantum protocol can 
compute the function as efficiently even if we require small probability of error only on average. 

Theorem 5.1 For all constant A: > 1, < e < i, Q^^^(S'fc+i) > n (AfV^^+i)^ . 

In fact, we prove a stronger intermediate claim. Let Pi be Alice, and for k > 2, let denote the player 
that holds the index a in an instance of Sk {a indicates which of the n instances of Sk-i to solve). Let Pk 
denote the other player. We refer to Pk as the "wrong" player to start a protocol for Sk. The stronger 
claim is that any k message protocol for Sk in which the wrong player starts is exponentially inefficient 
as compared to the log N protocol described above. 

Theorem 5.2 For a// constant A: > 1, < e < i, Q^'f'=(S'fc) > (iV^/*^ 

In fact, there is a classical A;- message protocol in which the wrong player starts with complexity 0{n) 
so our lower bound is optimal. 



Theorem 5.1 now follows directly. 



Proof of Theorem |5.1| : It is enough to show the lower bound for the two cases when the protocol 
starts either with Pk+i or with the other player. 

Let Pk+i be the player to start. Note that if we set a to a fixed value, say 1, then we get an instance 



of Sk. So Q^f +'(5fc) < Qf;_f'=+^(5fc+i). But Pk+i = Pk, so the bound of Theorem ^ applies. 

Let player Pk+i be the one to start. Then, observe that if we allow one more message (i.e.. A: + 1 

messages in all), the complexity of the problem only decreases: Q^^^''^*^^ (5fc+i) < q'I}^''^^ (Sk+i). So we 
again get the same bound from Theorem |5.2| . ■ 



We prove Theorem 5.2 by induction. First, we show that the index function is hard to solve with one 



message if the wrong player starts. This essentially follows from the lower bound for random access codes 



in |16|. The only difference is that we seek a lower bound for a protocol that has low error probability 
on average rather than in the worst case, so we need a refinement of the original argument. We give this 
in the next section. 



Lemma 5.3 For any 0<e<l, Qu^{Si) > (1 - H{e))n. 



Next, we show that if we can solve Sk with k messages with the wrong player starting, then we can 
also solve Sk-i with only A; — 1 messages of almost the same total length, again with the wrong player 
starting, at the cost of a slight increase in the average probability of error. 
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Lemma 5.4 For all k > 2, Q < e < \, Q^J' "-^^Sk-i) < 1 + logn, where £ = Qljy{Sk), and e' 
e + 4(^/n)i/l 



We defer the proof of this lemma to a later section, but show how it implies Theorem p.2| above. 



Proof of Theorem 5.2: We prove the theorem by induction on k. The case A; = 1 is handled by 



Lemma 5.3. Suppose the theorem holds for k — 1. We prove by contradiction that it holds for k as well. 



If Qu^'^iSk) = o(n), then by Lemma 5A there is a A; — 1 message protocol for Sk~i with the wrong 
player starting, with error e' = e + o(l) < ^, and with the same communication complexity o(n). This 
contradicts the induction hypothesis. ■ 

5.2 Hardness of the index function 

We now prove the average case hardness of the index function. 



Proof of Lemma 5.3: Let Q denote the message sent by Alice. For a prefix y £ {0, 1}* of 
length i > 0, let Qy be the encoding which is prepared by first fixing xi = yi, . . . ,Xi = yi and then 
choosing Xj+i, . . . , Xm at random and sending the state ax- Its density matrix is given by 



2m— i 



m-i ^ y^- 

1 m — i 



On the one hand, /(Q • ^) — ^i ths number of qubits in Q. On the other hand, for y G {0, 1}"', let ey be 
the error probability when xi = yi, I < j , and the index i = j + 1. Note that e = ^ Sj=o ^ "^yeio ly 
Moreover, we have I{Qy : ^j+i) > 1 — H{ey), since Bob has a measurement that predicts Xj^i with 
probability 1 — given Qy. We now claim that 

Lemma 5.5 ^ Ex ET=o^ HQx^-x, : ^i+i) < HQ : X). 
By this lemma, 

n—l 

I{Q:X) > E HQy --Xj+i) > a-Hie))n, 

i=0 ye{0,iy 



using the concavity of the entropy function. 



Proof of Lemma |5.5| : By the definition of mutual information, and using Fact 2.8, 
I{QXi • • • X, : Xi+i) = S{QXi ■■■Xi) + S{X,+i) - S{QXi • • • 

= [^ + ^ E Siay)] + [l]-[{i + l) + ^ ^K)] 
ye{o,ir ye{o,i}"+i 

= ^ E [S{c^y)-liSM + S{ay,))] 

= ^ E my--x.+i)- 

Moreover, from Properties (Q) and (^), 

m—l 

HQ-.X) > ^ /(Q,Xi,...,X, 



i=0 
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m—1 -, 

»n— 1 

= E ^ E ^(Qyl■■■y^ ■ ^i+i)' 

i=0 ye{0,l}'" 

which proves the claim. ■ 



5.3 The reduction step 



In this section, we show how an efficient protocol for Sk gives rise to an efficient protocol for Sk-i- 



The gross structure of the argument is the same as in [15, 12 1. However, we use entirely new techniques 
from quantum information theory, as developed in Section ^ and ^ and also get better bounds in the 
process. 



Proof of Lemma 5.4: For concreteness, we assume that k is even, so that Pk is Bob. Let V he a. 
protocol that solves Sk with respect to U with £ message qubits, error e, and k messages starting with 
Bob. We would like to concentrate on inputs where a is fixed to a particular value in [n]. This would give 
rise to an instance of Sk-i that is also solved by V, but with k messages. An easy argument shows the 
first message carries almost no information about ya, and we would like to argue that it is not relevant 
for solving S^-i- However, the correctness of the protocol relies on the message, so we try to reconstruct 
the message with Alice starting the protocol instead. We give the details below. 

We first derive a protocol V' which has low error on an input for Sk generated as below (we call the 
resulting distribution Ua=j)'- chosen uniformly at random from Xk-i, cl is set to j, yj is 

chosen uniformly at random from 3^fc_i, and for all i ^ j, register Yi is initialized to the state X^zg^j. ^ \^) 
(normalized). 

Let ej denote the error of V with respect to the distribution Ua=j- Note that ^^i^i < e, since having 
the in a uniform superposition over all possible inputs has the same effect on the result of the protocol 
as having it randomly distributed over the inputs (recall that we require that the input registers are not 
changed during a quantum protocol). Let be the mutual information I{M : Yj) in the protocol V 
when run on the mixed state Ua=j with yj being chosen randomly. 

Lemma 5.6 There is a protocol V which solves Sk with respect to the distribution Ua=j with error 6j = 
Ej + ^IJ^j^ error, i message qubits and k rounds starting with Bob, such that I{M : Yj) = 0. 

The protocol V is obtained by slightly modifying the first message in protocol V so that it is completely 
independent of Yj. This only affects the average probability of error. Moreover, in V' the first message 
does not carry any information about yj and is therefore completely independent of it. Intuitively this 
means that Alice does not need to get that message at all, or equivalently that she can recreate it herself. 
This gives a protocol for solving Sk-i{xj,yj) with k — 1 messages and with Alice starting. 

Lemma 5.7 There is a protocol V" that solves Sk-i with respect to U with e' error, £ + logn message 
qubits and k — 1 messages starting with Alice. 

Together we get Q'^ J'^{Sk-i) < i + logn as claimed. ■ 
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5.4 Proof of Lemmas 5.6 and 5.7 



Proof of Lemma |5.q : First consider the case when Yj is fixed to some z, but the rest of the 
inputs are as in Ua=j- In protocol V Bob apphes a unitary transformation V on his qubits and 
computes \(f>{z)) = V \0,Yi, . . . ,Yn) in register M (for the message) and B (for Bob's ancilla and in- 
put). In V' the message computation is shghtly different. Instead of computing \(f){z)), Bob com- 



putes 



^10,^1, 



I Yj_i_i, . . . , y„), where is the uniform superposition over yk-i- 



Clearly, in V' the state and hence the message M does not depend on yj = z, hence I{M : Yj) = 
when Uj is chosen randomly. 

Let us denote by pm{z) the reduced density matrix of the message register M va.V when the input is 
drawn according to Ua=j and yj = z, and let the corresponding density matrix for V' be pM- Clearly, 
P'^^ ^ Wk~i\ S^e^fc-i Pm{z). Let = \\pM - Pm{z) ||f By Theorem |]| we know that E^tz < 

Protocol V generates the pure state while the desired pure state is \(l){z)). Bob, who knows yj = z 
knows both \(l){z)) and By Theorem |l . 3| there is a local unitary transformation acting on register B 
alone, such that 

\\\Tzcl)'){Tzqy'\-\cP{z)){cP{z)\l < 2^,. 

The next step in protocol V' is that Bob applies the transformation to his register B. After that, 
protocol V' proceeds exactly as in V. Therefore, for a given z, the probability that V and V' disagree 
on the result is at most 2-y/t7, and the error probability of V on Ua=j is at most 

5j = ej + 2E,^, < + < ej + Apy\ 

where the second step follows from Jensen's inequality. ■ 

Proof of Lemma 5.7: Protocol V" solves an instance of Sk-i- Alice is given an input x £r Xk-i 



and Bob is given an input y yk-i- The protocol proceeds as follows. Alice and Bob first reduce 
the problem to an Sk instance taken from the distribution Ua=j for a random j. To do that, Alice 
picks j £ [n] at random, sets a = j and sends it to Bob; Alice sets Xj = x and Bob sets yj = y; Alice 
picks Xi Ejj Xk-i for i ^ j; and Bob initializes each register for i ^ j with J^z&y^-i 1^) (normalized). 

Notice that if Alice and Bob run the protocol V' over this input, then they get the answer Sk-i{x,y) 
with probability of error at most 



-I ri -| li -I ji 



< e + 4 



n 



i=l 



n 



i=l 



n 



i=l 



n 



Pi 



i=l 



We claim that 



Claim 5.8 J2iPi ^ ^i? where ii is the length of the message M. 

Hence e' < e + 4(£/n)i/4. 

Alice and Bob do not run the protocol V' itself, but a modification of it in which Alice sends the first 
message instead of Bob, thus reducing the number of rounds to k — 1. 

Let pm be the reduced density matrix of register M holding the first message that Bob sends to 
Alice in V' , for the input given above. By Lemma 5.6, we know that pM does not depend on yj = y. 



So Pm is known in advance to Alice. Alice starts the protocol P" by purifying pM- More specifically, 
let {|ei)} be an eigenvector basis for pM with real and positive eigenvalues Aj. Alice constructs the 
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superposition J2i V^l&i, i) mb registers M (containing the eigenvectors) and B (containing the 

index i), and sends register B to Bob. The state of the system after this message in V" is 

i 

whereas in V' it is 

\x{y)) = \xi,...,Xn)A®\Ty(t)')MB- 

The reduced density matrix of \S) to registers AM is the same as the reduced density matrix of \x{y)) to 
registers AM. By Theorem [4.1| , Bob has a local unitary transformation Vy (operating on his register B) 
that transforms |^) to \x{y))- Bob apphes Vy, and AHce and Bob then simulate the rest of the protocol V' . 
From this stage on, the runs of the protocols V' and V" are identical have the same communication 
complexity and success probability. ■ 

Proof of Claim |5.8| : Note that /ij is the same as the mutual information I{M : Yj) when V is run on 
the uniform distribution on x 3^^_i. So we prove the claim for the latter. 

For any i, I{Yi : Yi • • • Yi-iYi+i • • • y„) = 0. Therefore by Properties (||) and d) (cf. Section 1) we have 

n n 

I{M -.Yi-'-Y^) > Y.I{MY,---Yi_i:Yi) > Y.I{M -.Y.^) = J^n^ 

i=l i=l i 

As the first message M contains only ii qubits, we have J2i l^i ^ -^(Af : Yi • • • Yn) < ^i. ■ 
Acknowledgements 

We thank Jaikumar Radhakrishnan and Venkatesh Srinivasan for their input on the classical com- 
munication complexity of the pointer jumping and the subproblem we study in this paper, and Dorit 
Aharonov for helpful comments. 

References 

[1] D. Aharonov, A. Kitaev, and N. Nisan. Quantum circuits with mixed states. In Proceedings of the 
Thirtieth Annual ACM Symposium on Theory of Computing, pages 20-30, New York, May 23-26, 
1998. ACM Press. 

[2] A. Ambainis, L. J. Schulman, A. Ta-Shma, U. Vazirani, and A. Wigderson. The quantum com- 
munication complexity of sampling. In Proceedings of the 39th Annual Symposium on Foundations 
of Computer Science, pages 342-351, Los Alamitos, CA, November 8-11, 1998. IEEE Computer 
Society Press. 

[3] Ethan Bernstein and Umesh Vazirani. Quantum complexity theory. SIAM Journal on Computing, 
26(5):1411-1473, October 1997. 

[4] H. Buhrman, R. Cleve, and A. Wigderson. Quantum vs. classical communication and computation. 
In Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing, New York, NY, 
May 23-26, 1998. ACM Press. 

[5] Harry Buhrman and Ronald de Wolf. Communication complexity lower bounds by polynomials. 
Technical report, LANL CS archive, http://www.lanl.gov/abs/cs/9910010, 1999. 



13 



[6] Thomas M. Cover and Joy A. Thomas. Elements of Information Theory. Wiley Series in Telecom- 
munications. John Wiley & Sons, New York, NY, USA, 1991. 

[7] Pavol Duris, Zvi Galil, and Gcorg Schnitger. Lower bounds on communication complexity. Infor- 
mation and Computation, 73(l):l-22, April 1987. 

[8] Christopher A. Fuchs and Jcroen van de Graaf. Cryptographic distinguishability measures for 
quantum-mechanical states. IEEE Transactions on Information Theory, 45(4):1216-1227, May 1999. 

[9] A. Holevo. Bounds for the quantity of information transmitted by a quantum communication chan- 
nel. Problemy Peredachi Informatsii, 9(3):3-ll, 1973. English translation in Problems of Information 
Transmission, volume 9, 1973, pages 177-183. 

[10] R. Jozsa. Fidelity for mixed quantum states. Journal of Modern Optics, 41(12):2315-2323, 1994. 

[11] Hartmut Klauck. On quantum and probabilistic communication: Las vegas and one-way protocols. 
In Proceedings of the Thirty-second Annual ACM Symposium on Theory of Computing, Portland, 
OR, May 21-23, 2000. ACM Press. 

[12] E. Kushilevitz and N. Nisan. Communication Complexity. Cambridge University Press, 1997. 

[13] H. Lo and H. Chaii. Why quantum bit commitment and ideal quantum coin tossing are impossible. 
Physica D, 120:177-187, 1998. See also quant-ph/9711065. 

[14] D. Mayers. Unconditionally secure quantum bit commitment is impossible. Phys. Rev. Lett., 
78:3414-3417, 1997. 

[15] Peter Bro Miltersen, Noam Nisan, ShmucI Safra, and Avi Wigderson. On data structures and asym- 
metric communication complexity. In Proceedings of the Twenty- Seventh Annual ACM Symposium 
on the Theory of Computing, pages 103-111, Las Vegas, Nevada, May 29-June 1, 1995. 

[16] A. Nayak. Optimal lower bounds for quantum automata and random access codes. In Proceedings of 
the 40th Annual Symposium on Foundations of Computer Science, pages 369-376, New York, NY, 
October 17-19, 1999. IEEE Computer Society Press. 

[17] Noam Nisan and Avi Wigderson. Rounds in communication complexity revisited. SIAM Journal 
on Computing, 22(1):211-219, February 1993. 

[18] Christos H. Papadimitriou and Michael Sipser. Communication complexity. In Proceedings of the 
Fourteenth Annual ACM Symposium on Theory of Computing, pages 196-200, San Francisco, Cali- 
fornia, May 5-7, 1982. 

[19] Stephen J. Ponzio, Jaikumar Radhakrishnan, and S. Venkatesh. The communication complexity of 
pointer chasing, applications of entropy and sampling. In Proceedings of the Thirty-First Annual 
ACM Symposium on Theory of Computing, pages 602-611, New York, May 1-4 1999. ACM Press. 

[20] J. Preskill. Lecture notes, http://www.theory.caltech.edu/people/preskill/ph229/. 

[21] Ran Raz. Exponential separation of quantum and classical communication complexity. In Proceed- 
ings of the Thirty-First Annual ACM Symposium on Theory of Computing, pages 358-367, Atlanta, 
OA, May 1-4, 1999. ACM Press. 



14 



[22] A. Uhlmann. The 'transition probability' in the state space of a *-algebra. Reports on Mathematical 
Physics, 9:273-279, 1976. 

[23] Andrew Chi-Chih Yao. Quantum circuit complexity. In Proceedings of the 34th Annual Symposium 
on Foundations of Computer Science, pages 352-361, Palo Alto, CA, November 3-5, 1993. IEEE 
Computer Society Press. 



15 



